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The  pro!  lem  of  statistical  decision  making  under  uncertainty  is  considered.  A  Bayes 
approach  based  upon  prior  probabilities  which  are  found  using  an  objective  inference 
technique  developed  by  R.  L.  Kashyap  is  proposed  as  the  basic  solution  procedure. 

The  problem  is  formulated  in  a  statistical  decision  theory  format  and  the  general 
solution  technique  outlined.  Several  examples  are  solved  to  illustrate  its 
application . 


Using  this  inference  technique,  it  is  possible  to  have  different  priors  for 
different  experiments.  A  general  decision  criterion  is  formulated  to  handle 
these  situations. 
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BAYES  DECISION  RULES  BASED  ON  OBJECTIVE  PRIORS 


PART  Is  FO EMULATION  AND  APPLICATION 
By  T.  L.  Oberlinand  R.  L.  Rqshyap 

Abstract 

The  problem  of  statistical  decision  making  under  uncertainty  is 
considered.  A  Bayes  approach  based  up pjx  prior  probabilities  which  are 
found  using  an  objective  inference  technique  developed  by  R.  L.  Kashyap 
is  proposed  as  the  basic  solution  procedure.  The  problem  is  formulated 
in  a  statistical  decision  theory  format  and  the  general  solution  tech¬ 
nique  outlined.  Several  examples  are  solved  to  illustrate  its  applica¬ 
tion. 

Using  this  inference  technique,  it  is  possible  to  have  different 
priors  for  different  experiments .  A  general  decision  criterion  is  formu¬ 
lated  to  handle  these  situations.  .  . 
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INTRODUCTION 


A  good  deal  of  current  research  is  directed  at  systems  having  varying 
degrees  of  uncertainty.  A  common  method  of  handling  these  uncertainties 
Is  to  assume  they  can  be  characterized  by  random  variables  which  are  then 
assigned  probability  distributions. 

Basically  there  are  two  schools  of  thought  concerning  this  assign¬ 
ment  prc  edure— the  subjective  and  the  objective.  Using  the  subjective 

•  ••  -I 

approach  as  outlined  by  Savage  ,  the  decision  maker  attempts  to  fora  a 

consistent  set  of  preferences  from  which  he  obtains  his  prior  distribution; 

note  chat  two  decision  makers  with  the  same  information  may  have  different 

priors.  On  the  other  h&ud,  the  objective  approach  attempts  to  present 

necessary  or  logical  means  of  obtaining  the  priors.  Using  thiB  approach > 

two  decision  makers  having  the  same  information  will  have  the  same  priors, 

hence  the  term  objective.  Proponents  of  objective  approaches  range  from 

2  3 

Bernoulli  (principle  of  insufficient  reason)  to  Jeffreys  ,  Jaynes  , 

Tribus\  and  Kashyapf s ° 5 ^ ® 

In  this  paper  a  new  appi-oach  to  the  problem  of  statistical  decision 
making  under  uncertainty  is  presented.  It  uaes  Bayes  decision  rules  based 
on  prior  probabilities  found  by  applying  the  objective  inference  technique 
developed  by  Kashyap.  The  problem  is  formulated  in  a  statistical  decision 
theory  format  and  the  general  solution  technique  outlined.  Several  examples 
are  solved  to  illustrate  its  application. 
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The  major  emphasis  in  this  part  of  the  paper  is  on  the  formulation 
and  use  of  the  decision-making  technique.  Questions  of  justification 
are  postponed  until  Part  II. 

The  major  result  of  this  paper  is  the.  formulation  of  an  objective 
Bayes  approach  to  the  problem  of  decision  making  under  uncertainty.  This 
study  also  points  out  the  difficulties  associated  with  using  any  inference 
technique  whose  inference  varies  with  the  experiement.  A  general  approach 
which  permits  different  prior  probabilities  is  formulated  to  handle  these 


situations . 


II.  STATEMENT  OF  THE  DECISION  MAKING  PROBLEM 


A.  BASIC  ELEMENTS 

Assuming  that  the  uncertainty  in  the  decision  problem  can  be  re¬ 
presented  by  certain  random  variables — so  called  states  of  nature — the 
basi't  elements  of  the  decision  problem  under  uncertainty  are  as  follows: 

0:  unknown  state  of  nature,  0  e  0,  set  of  possible  states 
of  nature. 

•  .  t  ,  ( 

a:  actions  available  to  decision  maker,  a  e  A,  set  of 

'i 

potential  actions. 

,  t  .  „ 

V*  ' 

L(6,a):  loss  function,  L(0,a)  is  a  real- valued  function  defined 

0  x  a. 

e:  experiment,  e  e  E,  set  of  potential  experiments, 
y:  outcome  of  the  experiment,  y  e  Y  ,  set  of  possible  out¬ 
comes  of  an  experiment  e. 

d(y) :  decision  rule,  d(y)  selects  an  action  from  A  for  every 
possible  outcome  y. 

The  observations  y  are  dependent  on  0  through  the  conditional  prob¬ 
abilities  P(y|0),  which  are  assumed  to  be  known  for  all  y  £  Yg,  0  c  0. 
P(y|0)  is  assumed  to  have  a  fixed  set  of  values  throughout  the  experiment} 
if  it  changes,  so  does  the  experiment. 

For  certain  experiments  it  will  be  convenient  to  refer  to  a  sequence 

'  '  '  "  1  »  .  ,  > 

of  independent  observations  which  will  be  denoted  by  y(l)  ,y(2)  , .  ,y(m+l) . 
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Unless  otherwise  noted  these  variables  will  have  the  same  distribution 
as  y. 

There  also  may  be  additional,  testable  prior  information  about  6 

other  than  its  domain  0.  Information  about  0  is  testable  if,  given  any 

probability  assignment  for  0,  it  is  (potentially)  possible  to  verify  whether 

3 

or  not  this  probability  agrees  with  the  information  .  For  example,  it  may 
be  known  that  the  average  value  of  0  is  greater  than  a  certain  given 
number  or  that  the  probability  of  a  particular  outcome  Is  less  than  a 
given  value.  For  the  latter  case,  since  P'(y|0)  is  known,  this  implies 
a  restriction  on  the  allowable  probability  distributions  of  0. 

Other  types  of  loss  functions  considered  are  those  which  are  functions 
of  an  outcome  of  the  experiment  and  the  action  selected  prior  to  the  ob¬ 
servance  of  this  outcome.  Their  form  is  simply  I,(y,a).  However,  in  order 
to  emphasize  the  difference  in  loss  functions — which  is  essential  to  our 
approach,  as  we  shall  soon  sefe — we  shall  use  z  instead  of  y  to  denote  the 
observation  when  loss  functions  L(z»a)  are  considered. 

Both  of  these  types  of  loss  functions— L (6, a)  ,  L(z,a) — shall  be  con¬ 
sidered  in  this  paper.  In  the  former  0  is  the  variable  of  interest  and 
in  the  latter  z  is  the  variable  of  interest.  In  order  to  account  for 
different  experiments,  observations,  and  decision  rules,  more  notationaily 
convenient  forms  for  these  loss  functions  will  be  used.  If  d  represents 

a  decision  rule  for  experiment  e,  then  these  loss  functions  can  be  represented 

9 

by  L(0,d  )  or  L(z,d  )  (see  Ferguson  ). 

6  c 


Primary  consideration  in  this  paper  will  be  on  the  selection  of  . 
single  decision  rules,  d&,  and  not  on  selection  of  sequences  of  decision, 
rules. 

B.  DEFINITIONS  OF  BAYES  DECISION  RULE,  RESTRICTED  BAYES  DECISION  RULE 

The  decision  making  problem  is  essentially  the  selection  of  decision 
* 

rules,  d^_ ,  which  for  a  given  experiment  e  minimize  the  expected  value  of 

the  loss.  Then  the  experiment  which  results  in  the  minimum  expected  loss 

* 

is  selected  as  the  optimal  experiment  e  .  (More  will  be  said  on  this 
in  Sec.  VI.) 


For  a  given  probability  distribution  P  (0) ,  the  risk  r(P  ,d  )  is 
given  by 


r(pe-de)  ■  EeR(9,de>  *  Ee[Ey| 0L(Me)! 

or  by 

r(P  ,d  )  *  EflR(8,d  )  »  Eft[E  ,ftL(z,d  )J  (2-2) 

ee  u  e  Uz|o  e 

A  decision  rule,  dg,  which  minimizes  r(?e,d£)  is  called  a  Bayes 

decision  rule  with  respect  to  P^(9).  That  is, 

d  «-Arg  Min  r(P  ,d)  (2-3) 

"  d  ~  6 

If  a  decision  rule,  X  ,  minimizes  r(P  ,X  )  subject  to  the  restriction 
’  e  e  e  J 

that 

Max  R(0,X*>  <  C  (2-4'  ' 

a  e 


5 


where  0  e  0  and  C  is  some  specified  constant,  then  is  called  a 
restricted  Bayes  rule  with  respect  to  end  C  Note  that  Eqs.  2-3 
2-4  are  given  for  a  specific  experiment  e. 


6 


III.  GENERAL  SOLUTION  PROCEDURE 


The  crux  of  the  problem  in  forming  Bayes  decision  rules  is  the 

*  i  1 ' 

selection  of  the  prior  probability  density  for  the  unknown  state  of  r.ature 

0. 


A.  PRIOR  PROBABILITY  ASSIGNMENT 

In  this  paper  the  aseigiwent  problem  will  be  treated  as  an  inference 
problem,  and  an  objective  inference  technique  developed  by  Kashyap  5,  6,  ?,  8 
will  be  primarily  used. 


Following  this  procedure,  when  z  is  the  variable  of  Interest  and 
for'finite  0,  z  and  a  given  experiment  e,  the  prior  density  Pe(0)  is 
given  by 

P  (9)  ®  Arg  Max  I(z;6) 

6  P(0)  (3-1) 


where 

P(z  10.) 

I(z;9)  «l  l  P(Z;j  |ei)P(0±) 


and 


p » I  Ptejlv  p(0k) 

®k 


(3-2) 


(3-3) 


In  the  maximization  in  Eq.  3-1,  P(0)  is  subject  to  the  restrictions 


I 

e 


p(0i)  »  i 


(3-4) 
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(3-5) 


PC©i)  >  0  f  0±  e  0 


and  whatever  testable  information  on  0  is  available 


In  Ref.  6  Kashyap  has  developed  a  version  of  Eq.  3-1  which  allows 
for  previous  observations  to  be  taken  into  account.  If  0,  z  are  finite 
and  there  are  m  independent  observations  from  z — z(m+l)  is  the  variable 

of  Interest  now — let  Zm  denote  a  finite  valued  statistic  of  these  observa- 

. 

tions . 


Zm  »  <f)[z(l)  ,z(2) , .  1  ,z(m)  ] 

.  m  .  .  s 

* 

Then  the  inferred  prior  for  a  given  experiment  em  is 

Pe  (®)  83  Arg  Max  I(z(m+1)  ;0  |zm) 

P(9) 

where 


(3-6) 


(3-7) 


(3-8) 


(3-9) 
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P(iSmJ 0)  is  found  from  P(z| 0)  and  the  assumption  that  z(l),  z(2) , . . . ,z(m) 
are  independent.  P(0)  is  again  subject  to  restrictions  imposed  by  Eqs. 

3-4  and  3-5  and  the  testable  prior  information.  When  m  «  0,  Eq.  3-7  just 
reduces  to  Eq.  3-1.  It  should  be  noted  that  if  P(z|0)  is  altered,  the 
prior  probabilities  P£(0) ,  Pg^CQ)  given  by  Eqs.  3-1  and  3-7  will  be 
altered. 

When  6  is  the  variable  of  interest,  the  Principle  of  Maximum  Entropy 

3 

developed  by  Jaynes  will  be  used.  According  to  this  principle,  for  a 
given  experiment  e,  the  prior  Pe(0)  which  maximizes  the  entropy  of  0, 
I(0;0)  is  selected  as  the  inferred  prior.  For  finite  0,  this  assignment 
is  given  by 

P  (0)  «»  Arg  Max  I(0;8) 

8  P(0)  (3-10) 

* 

where 

1(9.0)  -  -  l  P(0i)  In  P(0±) 

9i  (3-11) 

where  again  P(0)  is  subject  to  Eqs.  3-4  and  3-5  and  whatever  restrictions 
imposed  by  testable  prior  information.  Note  the  independence  of  P&(0) 
from  P(y | 0) . 

A  comparison  of  the  fundamental  differences  between  these  inference 
techniques— Kashyap  ’  s  inference  and  Jaynes'  Maximum  Entropy — is  given  in 
Ref.  7. 
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For 


Equations  3-1,  3-7,  and  3-10  are  given  for  finite  0,Y. 
continuous  0,Y,  the  summations  can  be  replaced  by  integrals,  and  assuming 
these  integrals  exist,  the  same  approach  followed. 

B.  REASONABLENESS  OF  INFERENCE 

One  result  of  using  these  probability  assignment  procedures  is  that 
when  the  outcome  of  the  experiment  is  also  the  variable  of  interest,  the 
inferred  density  is  a  function  of  the  experiment. <  Thus,  for  different 
experiments  the  densities  Pe(9)  will  generally  differ.  This  is  not 
unreasonable  since  the  amount  of  information  available  (P(z}9)  is  a  form 
of  knowledge)  varies  with  the  experiments.  The  implications  of  this  will 
be  discussed  in  Sec.  VI. 

Kashyap  has  Justified  his  approach  to  the  inference  problem  by  show¬ 
ing  it  is  the  solution  of  a  repetitive  sero-sua  game.  Thus,  if  there  is 
an  unknown,  but  true,  frequency  distribution  for  a,  any  deviation  from  the 
optimal  inferred  density  by  the  player  can  only  increase  his  maximum 
possible  Iosj .  In  addition,  in  Ref.  8  Kashyap  has  shown  that  under 
certain  conditions  his  inference  technique  minimizes  the  maximum  possible 
divergence  between  the  unknown  frequency  distribution  of  z  and  that 
obtained  using  P(s|9)  and  the  objective  prior  given  by  Eq.  3-1.  In  a 
sense  then  this  is  a  conservative  inference  procedure. 
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c. 


GENERAL  SOLUTION 


We  shall  use  the  term  objective  Bayes  decision  rules  to  denote  those 
decision  rules  which,  are  Bayes  with  respect  to  priors  formed  by  using  the 
objective  inference  techniques  outlined  in  the  previous  section.  These 
objective  Bayes  decision  rules  then  follow  directly  from  the  definitions 
in  Eqs.  2-3  and  2-4  and  the  inferred  priors. 

The  final  step  in  solving  the  general  decision  making  problem  under 
uncertainty  is  the  selection  of  the  experiment  which  results  in  the  minimum 
risk  for  a  glv-in  loss  function.  For  the  inference  technique  used  here 
this  necessitates  comparing  risks  based  on  different  priors .  The  validity 
of  these  comparisons  is  discussed  in  Sec.  VI. 


n 


IV.  EXAMPLE  1  -  URN  PROBLEM 


.  ,  ,  This  problem  has.  been  considered  previously  by  Raiffa^.  We  will 

modify  it.  here  by  considering  losses  instead  of  gains  and  using  an  objective 
approach. 

,  •  ;  l '  .  ....  "V 

A.  STATEMENT  OF  THE  URN  PROBLEM 

The  basic  problem  is  as  follows.  There  is  a  collection  of  1000  urns, 

each  qf  v/hichis  either  the  type  0^  or  type  but  there  are  no  external 

markings  distinguishing  them.  Urns  of  type  0^  contain  4  red  balls  and  6 

black  balls;,. .urns  of  type  0 ^  contain  9  red  balls  and'l  black  ball. 

.  ,  >  ♦ 

•  , •  . .  •  t*  :  .. 

There  are  two  participants — a  decision  maker  and  a  neutral  referee . 

The  decision  maker  is  to  repeatedly  draw  one  of  the  urns  from  the  collection 
and  guess  whether  it  is  of  type  0^  or  0£.  He  can  also  refuse  to  play. 
Depending  upon  his  action  and  upon  the  type  of  urn  (which  is  known  to  the 
referee),  he  receives  a  reward  or  pays  a  penalty.  He  wishes  to  choose  his 
actions  so  that  ho  minimizes  his  average  loss  over  all  the  draws.  After 
incurring  the  loss  the  decision  maker  returns  the  urn  to  the  collection. 

It  is  assumed  that  the  number  of  0^,  urns  the  collection  does  not 
vary  so  that  there  is  an  underlying  (frequency)  distribution  for  0,  albeit 
unknown. 

Therefore,  let  the  state  space  0^,  and  action  space  Au  be 


Ai  “  {aul,au2‘au3} 


(4-2) 


where 

\ll  guess  0 
a^:  guess  @2 
au2:  refuse  to  play 
The  losses  are  given  in  Table  4-1. 


TABLE  4-1 

LOSS  MATRIX  FOR  URN  PROBLEM 

aul _ au2  au3 

61  -40  5  0 

e2  20  -100  0 


Consider  two  possible  experiments 

e  :  no  observations  at  cost  $0.00 
uo 

e^:  a  single  observation,  y,  at  cost  $8.00 
let  y^  *  red  ball;  y2  =  black  ball.  Then 

P(y1le1>  =  *4  PCyl*  62)  "  *9  (4-3) 

P(y2|e1)  =  .6  PCy2le2)  =  -1 

where  an  observation  y  refers  to  withdrawing  a  ball  from  the  chosen  urn 

and  observing  its  color. 


13 


B.  SOLUTION 

Following  the  procedure  outlined  in  the  preceding  section,  the 
objective  Bayes  rules  deuQ,  ^or  eu0>  can  easily  be 

found* 


Since  the  loss  functions  are  of  the  form  L(9,d),  the  principle  of 
Maximum  Entropy  will  be  used  to  infer  Peuo(6)>  Peui^*  If  there  are  rr. 
restrictions  other  than 


I  PO.)  -  1  ,  P(9.)  >  0 

i=l 


i=l,2 


(4-4) 


then  by  the  Maximum  Entropy  Principle 
Peuo<V  "  Peuo<G2>  =  *5 

and 


(4-5) 


Peui(ei)  =  .5  ,  peul<V  »  -5 


(4-6) 


*  * 

In  order  to  find  dg^,  d6u^,  is  is  perhaps  easiest  to  enumerate  the 
candidates  or  allowable  strategies.  Denote  them  by  a.  Then 


a  :  refuse  to  play 
uo 


C  , :  choose  a  , 
ul  ul 


a  choose  a  .. 
u2  u2 


0  if  y  =  red,  choose  a 
u3  J  ul 
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:  if  y  =  black, choose  a 


cr  . :  "  "  red 

U4 

:  "  "  black 

a  c:  "  "  red 

u5 

:  "  "  black 

a  "  "  red 

u6 

:  "  "  blaclc 


(.4-7) 


* 

Thus,  a  ,  or  , .  a  .  are  the  candidates  for  d_.  ;  a  a  , ,  a  _ ,  a  ..  are 
uo’  ul’  u2  &Uo  u3  u4*  u5*  u6 

* 

the  candidates  for  d~  . 

eul 


Following  Sec.  Ill,  it  is  easily  seen  that 


and  their  corresponding  risks  are 

r(P  ,d*  )  =  .5(5)  +  .5 (-100)  =  -47.5 
uo  do 

r(P  »<£  )  =  .5 (-14)  +  .5 (-80)  =  -47 
ul  ul 


(4-8) 

(4-9) 

(4-10) 


The  next  step  in  the  decision  making  process  is  the  selection  of 
the  best  experiment.  While  in  this  problem  this  selection  is  straight¬ 
forward,  it  is  not  always  so,  as  *-;e  shall  see  in  the  next  section. 
Accordingly,  we  shall  postpone  selecting  the  "best"  experiment  until 


Sec.  VI. 


V. 


EXAMPLE  2  -  BALL  PROBLEM 


A.  STATEMENT  OF  THE  BALL  PROBLEM 

The  ball  problem  is  similar  to  the  urn  problem  but  there  are  some 
basic  differences.  Again  there  is  a  collection  of  1000  urns,  each  of 
which  is  of  type  0^  or  @2,  having  the  same  characteristics  as  in  the  urn 
problem.  Again  there  are  two  participants — a  decision  maker  and  a  neutral 
referee.  However,  now  the  decision  maker  is  to  repeatedly  draw  one  of  the 
urns  from  the  collection  and  then  withdraw  a  ball  from  this  selected  urn. 
Without  knowing  the, type  of  urn,  he  is  to  guess  the  color  of  this  ball. 
Depending  upon  his  action  and  upon  the  color  of  the  ball,  he  received  a 
reward  or  pays  a  penalty.  Again  he  wishes  to  choose  his  actions  so  that 
he  minimizes  his  average  loss  over  all  the  draws.  After  incurring  the 
loss,  the  decision  maker  returns  the  urn  to  the  collection. 


Let 


where 


the  state  space  0^  and  action  space  A^  be 

®b  *  !61-V 

(5-1) 

\  "  {abl’ab2’ab3} 

(5-2) 

a^^5  guess  red 

a^2:  gu^ss  black 

- 

<3o  uot  play 
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If  2  denotes  the  observation  of  the  ball  which  determines  the  loss 


incurred,  then  the  loss  function  L(2,a^)  is  given  in  Table  3-2,  where  z^  =  red, 
3  black. 


TABLE  5-1 

LOSS  MATRIX  FOR  BALL  PROBLEM 

%1  *b2  *b3 

z^  -40  5  0 

z?  20  -100  0 


Now  consider  two  experiments  e^,  e^.  With  e^  there  is  only 

prior  information  on  the  collection  of  urns  and  the  conditional  probabilities 

P(z|9),  which  are  the  same  as  those  given  for  y  in  Eq.  4-3,  upon  which  to 

base  the  decision.  That  is,  under  e,  the  decision  maker  withdraws  an  urn 

bo 

from  the  collection,  selects  an  action,  a.Q  e  A^,  and  then  withdraws  a  ball  z 
from  this  urn.  However,  under  e^  let  the  decision  maker  withdraw  a  ball 
from  the  selected  urn  and  observe  its  color  before  making  a  decision. 

Call  this  observation  z(l).  Then  have  him  replace  this  ball,  select  an 
action  e  A^,  withdraw  another  ball,  z(2)  =  z,  and  incur  the  loss  LCz.a^). 
Let  the  cost  of  this  extra  observation  be  $3.00. 

Thus,  there  are  two  possible  experiments, 
e^:  no  observations,  at  cost  $0.00 
e^:  one  observation,  z(l),  at  cost  $3.00 


mj 


sir.  •***  ^MMX~nnA*~~* 


Ci 


Let  2^  «  red,  ^  ~  b*ack.  Th®3*  PCs^l^),  .1*1,2;  i=l,2;  1=1,2,  is  the 
same  as  that  given  in  Eq.  4-3  for  y. 


Be  SOLUTION 

Section  III  outlines  the  solution  technique.  P  (0)  and  P  (8) 

%o  %1 

are  given  by  Eqs.  3-1  and  3-7,  respectively.  If  there  are  no  restrictions 
other  than 


I  : 


then 


l  \  <ei>  -  1  i 
i-1  %3 


Pa  (0,)  >  0  ,  1=1,2;  j=l,2  (5-3) 


P  (8l>  3  .465  ; 

DO 


Pp  (0J  -  .535 
“bo 


(5-4) 


and 


Po  (0  )  =  .505  ; 
ebl  1 


P  (6  )  «  .495 
bl  * 


(5-5) 


(Comparison  with  Eqs.  4-5  and  4-6  illustrates  the  effect  of  the  variable 
of  interest  upon  the  inferred  prior  for  0.) 


Following  the  same  procedure  as  in  the  urn  problem,  wv  will  enumerate 
the  allowable  strategies  for  experiments  e^,  e^. 
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o^:  refuse  to  play 


ffbl;  choose 


^1 


%2:  choose 


^2 


if 

2(1) 

=  red. 

choose 

^1 

• 

II 

11 

hlack. 

II 

aw 

It 

II 

red 

II 

%1 

• 

• 

tr 

If 

black. 

II 

%2 

°b58 

ti 

II 

red 

It 

%2 

• 

• 

tr 

It 

black 

II 

*hl 

ab6: 

it 

If 

red 

II 

*b2 

• 

* 

it 

II 

black 

II 

®b2 

°bo> 

abi» 

CM 

t>° 

are  the 

candidates 

e  the  candidates  for  d 

ebl 


ebo’  ub3’  b4’  '  b5 ’  °b6 


Following  Sec.  Ill,  it  is  easily  seen  that 


d  (z(i))  =  a 
ebl  b4 


(5-6) 


d  their  corresponding  risks  are 


r(Pe,  ,do  >  =  "58 (.465)  -  5.5(.535)  »  -29.91 

DO  H)0 

r(P  ,d*  )  *  -33.4C.505)  -28.15 (.495)  -  -30.80 
bl  ®bl 


(5-7) 

(5-8) 
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The  selection  of  the  "best"  experiment  is  not  as  straightforward 
as  in  the  urn  problem — since  the  risks  in  Eqs.  5-7  and  5-8  are  based  on 
different  priors ,  how  valid  is  a  comparison  between  them  for  use  in 

selecting  the  best  experiment?  We  consider  this  question  in  the  next  / 


section. 


VI.  SELECTION  OF  EXPERIMENT 


The  previous  sections  nave  been  directed  mainly  at  the  selection 
of  the  "best"  decision  rule  for  a  given  experiment,  where  the  conditional 
probability  P(y|0)  has  been  fixed  throughout  the  experiment. 

However,  the  final  step  in  the  overall  decision  making  problem  is 
the  selection  of  the  optimal  experiment  from  a  set  of  possible  experiments. 


A.  BASIC  PROBLEM 

Quite  simply,  the  basic  problem  confronting  us  is  the  selection  of 
a  criterion  for  choosing  the  optimal  experiment.  The  criterion  for  sele¬ 
ction  of  the  decision  rule  for  a  given  experiment  was  to  choose  the 

“fc 

decision  rule,  de>  which  minimized  the  risk  r(Pg,de)  where  Pg  was  given 
by  our  objective  inference  technique.  Notice,  however,  that  this  risk 
is  a  function  of  Pg  and  that  for  loss  functions  of  the  type  L(z,a),  Pg 
may  depend  on  the  experiment  under  consideration.  Accordingly,  sole 
use  of  r(-,-)  as  a  criterion — that  is,  choose  experiment  e^  over  e^  if 
and  only  if 


*0?  »<*!  >  <  r(P„  ) 

i  i  j  w3 


(6-1) 


is  difficult  to  justifj  since  different  probability  distributions  for  0 
are  used  in  evaluating  the  respective  risks.  If  P  ,  ?  ere  identical 

Si  6j  . " 

then  Eq.  6-1  can  easily  be  justified  as  an  ordering  using  tne  arguments 
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presented  in  Part  II.  When  these  priors  differ,  Eq.  6-1  must  be  used 
with  more  care.  Consider  the  following  example. 


1.  Ball  Problem — Choose  Between  e,  ,  e,  , 

In  the  previous  section  experiments  e.  ,  e^  were  introduced  in 
the  ball  problem.  The  question  now  is,  which  experiment  should  be  used? 
(Note  that  experiment  e^  has  a  $3  cost  while  e^  has  no  extra  cost. 
Hence  the  question  could  be  phrased  as — is  e^  worth  $3  more  than  e^  ?) 


The  respective  risks  for  the  two  experiments  are 

■  i  i 


:<P  ,d  =  o  )  =  -58(.465)-5.5(.  535)  =  -  29.91  (6-2) 

ebo  ebo 


r(P  ,d  =  Cf)  =-33. 4(.505)-28.15(. 495)  =  -30.80  (6-3) 

ebl  %1  04 


However,  these  risks  do  .;oL  tel),  the  whole  story.  Consider  r^.a^)  and 
rCpvCT^)  as  functions  of  p,  the  true  probability  that  0  =  0^. 


r(p,CTb2)  =  "5.5  -  52.5  p 


(6-4) 


=  -28.15  -  5.25  p 


(6-5) 
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These  risks  are  plotted  versus  p  in  Fig.  6-1.  They  intersect  at  about 


p  =  .480,  Thus,  when 
average  loss  than  0^, 


rCPebi’ab2 


p  >  .480  (p  <  .480),  0^2  results  in  less  (greater) 

so  if  p  actually  did  equal  P  (0  )  (=.505) 

l 


)  < 


r(P  ,0..) 


(6-6) 


However,  from  Eq3.  6-2  and  6-3, 


,0b2)  >  r(P^^,0b4) 


(6-7) 


Thus,  we  have  a  situation  in  which  we  prefer  0^^  to  0.  £  if  we  evaluate 

their  respective  risks  based  on  P  ,  P  , respectively;  but  if  we  compare 

*bl  ®bo  ' 

risks  when  both  are  evaluated  at  p  =  P£  (0^),  we  prefer  0,  ~  to  °t4* 

hi  ^  jc 

Obviously,  factors  other  than  just  r(P  ,d  )  and  r(P  ,d  )  should 

%o  ^o  %1  'ill 

be  taken  into  consideration. 


B.  QUASI-BAYES  APPROACH 

It  seems  that  so  long  as  our  objective  inference  technique  infers 
different  priors  for  different  experiments,  our  final  selection  of  experi¬ 
ment  must  remain  somewhat  arbitrary.  In  order  to  formalize  these  arbitrary 
considerations  somewhat,  we  shall  introduce  the  quasi-Bayes  approach. 

We  assume  that  there  is  a  fixed  set  of  experiments,  E  =  {e^,e2> . . • je^) 
from  which  we  wish  to  select  an  experiment  and  decision  rule.  We  shall 
call  each  decision  rule  and  its  associated  experiment  a  strategy  and  let 
ir^,  i=l,  2,...,m,  denote  the  admissible  strategies  formed  by  the  experiment 
in  E  and  their  corresponding  decision  rules.  (Note  that  a  decision  rule 
d^  might  be  admissible  relative  to  the  other  decision  rules  in  a  given 
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experiment  e  e  E,  but  the  strategy  (e-d^)  might  be  inadmissible  relative 
to  the  strategies  it^,  i=l,2, . . . ,m.)  Let  p  be  a  probability  distribution 
over  the  it’s;  that  is,  p  is  our  randomized  choice  of  decision  rule  and 
experiment.  We  will  restrict  p  to  those  mixtures  of  it's  which  are  them¬ 
selves  admissible  strategies. 

Now  if  the  true  distribution  for  6 — P(9) — were  known,  the  risk  r(P,p) 

could  be  evaluated  and  p  chosen  to  minimize  it.  For  example,  if  p^,  p^  p^ 

denote  the  probabilities  that  p  selects  decision  rules  d  ,  d  ,  d  from 

el  e2  e3 

experiments  e^,  e^,  respectively,  then 


r(P,p)  ■  p  r(P,d  )  +  p.r(P,d  )  +  p  r(P,d  ) 
1  el  L  e2  J  e3 


(6-8) 


However,  not  only  is  the  true  distribution  of  0  unknown,  but  our 

objective  inference  technique  may  assign  a  different  prior  distribution, 

Pe(8),  to  each  experiment  c,  Using  this  technique,  no  corresponding  r(Pe>p) 

exists  since  Pg  varies  with  the  selections  of  p.  Let  i  denote  a  mapping 

which  relates  the  decision  rule  selected  using  p  with  its  associated 

objective  probability.  Then  define  r(T,p)  analogously  to  r(Pe,p);  note, 

however,  that  T  is  not  a  probability  density.  For  example,  if  p^,  p^>  P3 

are  as  defined  in  Eq.  6-8  and  d  ,  d  ,  d  are  decision  rules  from  experi- 

el  e2  e3 

ments  e^,  e^,  e..,  then 

r(i,p)  =  p.rCP^  ,d  )  +  P,r(P  >d  )  +  p~r(P  ,d  )  (6-9) 

1  ~1  el  *  e2  e2  J  e3  3 


Define  p  to  be  ?  quasi-Bayes  decision  rule  with  respect  to  x  if 
* 


p  =  Arg  Min  r(x,p) 
P 


(6-10) 
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were  naturally  p  is  subject  to  the  conditions  necessary  for  it  to  be  a 
probability  distribution. 

If  there  are  no  further  restrictions  on  p  or  r(T,p)  and  there  is  one 
decision  rule-experiment  combination  whose  risk  is  less  than  any  other 
risk,  then  p  will  select  this  decision  rule-experiment  rith  probability 
one.  That  is,  the  unrestricted  quasi-Bayes  procedure  just  reduces  to: 
select  the  decision  rule  dg  and  experiment  e  having,  the  least  risk  r(P£,de). 

However,  if  a  restriction  such  as 

R(9,p*)  <C  V  6e0  (6-11) 

* 

if  desired,  then  p  will  not  necessarily  be  a  degenerate  probability  dis¬ 
tribution.  We  will  denote  the  decision  rules  satisfying  Eqs.  6-10  and 
6-11  by  A  and  call  them  restricted  quasi-Bayes  decision  rules. 

It  is  this  choice  of  C  which  allows  the  decision  maker  to  formalize 
his  arbitrary  considerations .  A  C  close  to  the  minimax  risk  indicates 
little  confidence  in  the  accuracy  of  the  inferred  priors,  whereas  a  C 
close  to  the  maximum  risk  of  the  unrestricted  Bayes  rule  indicates  con¬ 
fidence  that  the  inferred  and  actual  distributions  are  "close." 

This  approach,  therefore,  has  some  of  the  advantages  of  both  the 
minimax  and  unrestricted  Bayes  decision  rules.  On  one  hand  it  allows  the 
decision  maker  to  restrict  his  maximum  possible  loss  to  some  level  "C,  but 
on  the  other  hand,  since  Pg  is  used  in  evaluating  the  risk  that  A  minimizes, 
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ic  isn't  so  pessimistic  that  it  concentrates  unduly  (as  determined  by  C) 
on  the  worst  states  of  nature.  An  example  illustrating  this  behavior  is 
given  in  the  next  section. 


C.  EXAMPLE  3  -  MODIFIED  BALL  PROBLEM 

The  ball  problem  we  will  consider  here  has  one  slight  modification 
from  that  presented  in  Secs.  V  and  VI-A-instead  of  a  $3  observation  cost 
for  experiment  e^,  we  will  assume  a  $6  observation  cost.  This  modifica¬ 
tion  will  give  a  more  interesting  solution  to  the  problem. 


Due  to  this  modification  in  experimental  costs,  the  unrestricted 
quasi-Bayes  rule  selects  experiment  e.  and  decision  rule  cr^*  That  is, 
Eq.  6-2  is  unchanged,  but  Eq.  6-3  is  changed  to 


r(P  ,d  «  a,.)  *  -30.80  +  3  =  -27.80 

Si  Si  b4 


(6-12) 


and  const 


sequently  r(P  >^2^  *s  ^esS  t*ian  ^is  new  risk  s0  *s 

Cbo  '  " 

the  unrestricted  quasi-Bayes  rule. 


If  we  wish  to  restrict  the  maximum  possible  risk  to  less  than  some 
specified  constant  C,  it  is  then  necessary  to  randomize  over  experiments. 
Of  the  set  of  strategies  available,  only  0^,  ^2*  snd  are  admissible. 
If  A^,  A^,  A^  represent  the  probabilities  that  the  restricted  quasi-Bayes 
rule  A  selects  0'^,  and  respectively  (and  their  associated 

experiments),  then  applying  Eqs.  6-10  and  6-11 — for  this  case  a  linear 
programming  problem  results — we  get 

A*  =  0,  A*  =  =  .263,  A*  =  .737  (6-13) 
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(6-14) 


if  C  =  -20;  if  C  *  -25.15,  we  get 


)£  -  0  «  A*,  A*  =  1 


and  if  C  =  -26,  we  get 


C  +  25.15  ~*  „ 

X1 - 05 —  -096’  x2-0’ 


Note  that  the  range  for  effective  C  is 


A*  *  .904 


(6-15) 


-26.43  <  C  <  -5.5  ,(6-16) 

That  is,  C  should  be  greater  than  the  minimax  risk.  -26.43  and  less  than 
the  maximum  risk  under  the  unrestricted  quasi-Bayes  rule. 


This  minip^x  risk  can  be  obtained  by  using  well  established.*  tech- 

"k  k  k  k 

niques .  If  y  denotes  the  minimax  strategy  and  Y^>  Y2»  Y-j  the  probabilities 

k 

that  y  respectively  selects  0^2*  ab4>  (anc*>  °f  course,  their  cor¬ 

responding  experiments)  then  applying  these  techniques  we  get 

Y*  =  .149,  y*  =0,  Y3  =  >851  '  (6-17) 

and  the  corresponding  minimax  risk  is  -26.43,  independent  of  the  true 

A 

probability  distribution  of  0  since  y  is  an  equalizer  rule. 


For  these  rules  the  risk  r(p,d)  can  be  calculated  where  p  is  the 
actual  distribution  of  9.  In  Fig.  6-2  the  risks  for  these  rules  are 
plotted  versus  p,  the  true  probability  that  0  =  0^,.  From  the  figure  it 
is  seen  that  the  decision  maker  has  choices  ranging  from  a  minimum  pos¬ 
sible  risk  of  -$58,  with  a  possible  risk  of  -$5.5,  to  a  guaranteed  risk 
of  -$26.43.  The  compromising  behavior  of  the  restricted  quasi-Bayes  is 
evident . 
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restricted  quasi-Bayes 

minimax 

quasi-Bayes 


Figure  6-2:  Comparison  of  Risks  for  Ball  Problem 
with  $6  Observation  Cost 


D .  DISCUSSION 

The  basic  difficulty,  of  course,  is  the  possible  variation  in 
inferred  prior  with  changing  P(y | 0) .  If  the  inferred  prior,  P(0),  were 
fixed  over  the  experiments,  r(P,d)  could  be  used  as  the  sole  decision 
criterion.  However,  a  variation  in  the  prior  requires  seme  modification 
in  the  decision  making  procedure.  (We  discuss  some  of  the  anomalies 
associated  with  quasi-Bayes  in  Part  II  of  this  paper.)  We  have  elected 
to  use  the  C  restriction,  feeling  that  it  is  a  concise  and  general  way 
of  handling  the  difficulty.  In  general  any  Bayes  procedure  using  an 
(objective)  inference  technique  which  in  turn  is  a  function  of  P(y 1 9) 
will  encounter  this  difficulty  of  multiple  priors. 
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VII.  CONCLUSIONS 


The  use  of  this  approach  enables  the  solution  of  a  wide  range  of 
practical  problems  involving  uncertainty.  It  can  be  applied  to  the  group 
Bayesian  problem  (Raiffa  ,  Chapter  8) — collectively  selecting  C  may  be 
easier  than  finding  a  collective  (subjective)  prior.  In  Ref.  12  this 
procedure  is  uaad  to  find  both  open-loop  and  closed-loop  controllers  for 
dynamic  systems  having  unknown  parameters.  An  advantage  of  this  approach 
is  its  ability  to  objectively  use  the  available  prior  information. 

When  there  is  only  one  experiment,  the  application  of  our  approach 
is  straightforward  and,  as  will  be  seen  in  Part  II,  quite  justifiable. 

r 

It  is,  in  a  sense,  an  objective  alternative  to  the  minimax  procedure. 
However,  when  there  Is  a  choice  of  experiments,  our  results  are  not  as 
conclusive.  More  will  be  said  on  this  subject  in  Part  II  but  suffice 
it  to  say  that  no  alternative  decision  making  procedure  exists  which 
doesn't  have  some  serious  objections  to  it! 
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BAYES  DECISION  RULES  BASED  ON  OBJECTIVE  PRIORS 
PARI  II:  JUSTIFICATION 
By  T.  L.  Oberlin  and  R.  L.  Kashyap 


Abstract 


In  Part  I  of  this  paper  an  objective  Bayes  approach,  to  decision 
making  under  uncertainty  was  proposed.  The  priors  were  obtained  by  using 
an  objective  inference  technique  developed  by  Kashynp.  Justification 
arguments  are  now  given  for  this  approach  based  upon  axiomatic  considera¬ 
tions,  reasonableness  arguments  and  comparison  of  the  average  losses  in¬ 
curred  using  both  this  approach  and  the  minimax  approach. 

It  is  shown  that  in  situations  where  the  experimentation  is  fixed 
and  the  decision  problem  is  faced  repeatedly,  but  not  necessarily  an 
infinite  number  of  times,  this  approach  is  justifiable.  In  situations 
where  there  is  a  choice  of  experiments,  these  arguments  are  not  as  con¬ 
clusive;  however,  the  approach  still  has  practical  merit  as  an  objective 
alternative  to  the  minimax  approach. 


34 


I.  INTRODUCTION 

The  basic  approach  of  Part  I  was  to  assume  that  the  uncertainties 
present  in  the  decision  problem  could  be  characterized  by  random  variables 
which  are  then  assigned  probability  distributions. 

We  wish  now  to  give  justification  for  this  approach.  We  shall  do 
this  by  axiomatic  considerations,  reasonableness  arguments  and  comparison 
of  average  losses  incurred  using  other  approaches.  These  approaches  include 
subjective  Bayes,  empirical  Bayes  (Robbins'S  and  minimax.  Techniques  such 
as  hypothesis  testing,  confidence  intervals  and  point  estimation  will  not 
be  discussed.  Lindley  gives  an  interesting  comparison  between  some  of 
these  approaches  and  those  using  the  concept  of  a  prior  distribution. 

The  merits  of  and  objections  to  each  approach  depend  a  great  deal 
upon  the  type  of  problem  in  which  they  are  used ,  especially  the  repetitive 
nature  of  the  problem.  We  start  by  considering  a  repetitive  decision 
problem  in  which  there  is  no  choice  of  experiment  and  then  generalize  to 
the  situation  where  the  decision  maker  has  a  choice  of  experiments. 


H .  SINGLE  EXPERIMENT 

In  general  we  do  not  expect  to  show  that  any  one  decision  rule  is 
always  ''better"  than  others;  that  is,  there  will  be  situations  in  which 
a  number  of  decision  rules  are  admissible.  Naturally,  we  are  primarily 
interested  in  those  situations  in  which  the  objective  Bayes  rules  seem 
to  be  better  than  any  others,  so  we  assume  that  we  are  in  a  situation  in 
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which  the  decision  problem  is  faced  repeatedly — not  necessarily  an  infinite 
number  of  times — and  in  which  the  experimentation  is  fixed. 

In  irder  to  provide  a  basis  for  comparison  and  to  provide  insight 
into  the  nature  of  the  justification  problem,  an  axiomatic  approach  to 
selecting  a  decision  criterion  is  briefly  discussed,  and  then  arguments 
are  given  for  the  validity  of  other  approaches  to  forming  decision  rules. 
Finally,  a  direct  comparison  of  the  objective  Bayes  and  minimax  approaches 
is  made. 

A.  AXIOMATIC  APPROACH 

In  using  the  axiomatic  approach,  the  order  of  the  decision-making 
process  is  reversed  from  that  given  in  Part  I.  In  Part  I  a  decision 
criterion  is  postulated,  resulting  decision  rules  formed  and  then  the  per¬ 
formance  of  these  decision  rules  is  evaluated.  An  alternative  procedure — 
the  axiomatic  approach — is  fo  list  certain  desiderata  or  axioms  for  a 
decision  criterion  to  fulfill  and  then  see  if  the  proposed  criterion 
satisfies  these  axioms.  This  is  done  prior  to  using  the  criterion  to  find 
decision  rules.  The  dxioms  ar  posed,  naturally,  so  that  if  a  criterion 
fulfills  them  the  resulting  decision  rule:s  are  justifiable.  Not  surpris¬ 
ingly,  one  of  the  major  problems  in  applying  the  approach  is  the  selection 

of  axioms  such  that  they  are  mutually  compatible  and  also  intuitively 

3 

reasonable  (see  Bimbaum  )  . 

4 

In  Chapter  13  of  their  book  ,  Luce  and  Raiffa  give  a  very  readable 
presentation  of  the  axiomatic  approach,  and  consequently  it  will  not  be 
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repeated  here.  However,  some  conclusions  can  be  drawn.  Namely,  of  the 
four  decision  criteria  that  they  applied  to  their  list  of  axioms — minimax, 
minimax  regret,  a  pessimism-optimism  and  insufficient  reason — no  one 
criterion  dominated.  That  is,  of  these  four  decision  criteria,  none  ful¬ 
filled  all  the  axioms  in  a  completely  satisfactory  manner. 

However,  the  criterion  based  on  the  principle  of  insufficient  reason 
satisfied  the  basic  axioms  1  thru  9  in  Luce  and  Raiffa,  but  there  is 
question  as  to  when  the  criterion  is  applicable,  that  is,  the  axiomatic 
representation  of  complete  ignorance.  The  essence  of  the  argument  is  that 
if  the  decision  maker  is  "completely  ignorant"  as  to  the  state  of  nature, 
then  should  his  final  decision  rule  be  altered  by  deleting  a  state  of 
nature  (assuming  0  finite)  that  has  the  same  losses  as  another  state  for 
any  action  a  e  A?  IJ:  the  number  of  states  of  nature  is  changed,  the 
probability  assigned  to  each  state  ty  the  principle  of  insufficient  reason 
will  change,  which  can  change  the  selected  decision  rule.  Hence,  the 
argument  is  joined I 

✓ 

If  the  decision  criterion  given  by  the  objective  Bayes  approach  is 
applied  to  these  axioms,  for  fixed  experimentation  (only  one  prior  Pe(0)) 
it  can  be  shown  that  it  satisfies  the  basic  axioms  1  thru  9.  However,  it 
is  subject  to  the  same  criticism  as  the  insufficient  reason  criterion. 
Namely,  that  if  the  number  of  states  of  nature  is  changed,  the  inferred 
prior  P  (0)  will  generally  change  and  possibly  the  resulting  objective 
Bayes  decision  rule  will  be  altered. 
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It  serins  as  though  any  decision  criterion  based  upon  an  i xxerence 
reproach  which  is  independent  of  the  values  assigned  to  the  losses — as 
both  the  principle  of  insufficient  reason  and  our  objective  inference 
are — will  be  subject  to  this  criticism.  For  these  Inference  techniques, 
the  number  of  states  of  nature  must  remain  fixed  for  a  given  problem; 
if  this  number  changes,  the  problem  is  changed  and  a  new  ordering  of 
decision  rules  is  possible.  It  is  felc  that  this  failing  is  relatively 
minor  in  comparison  with  the  difficulties  associated  with  those  procedures — 
such  as  nr.  limax  or  minimax  regret — which  can  be  viewed  as  having  inference 
techniques  based  coon  the  loss  function. 

Ir.  sun. nary,  ;be  axiomatic  approach  is  useful  since  it  shows  that 
.  ore  .uis  yet  found  a  decision  criterion  which  has  all  the  desired  prop¬ 
erties  A.  at  cat  logically  be  desired. 

B.  JUSTIFICATION  OF  BAYES  APPROACH 

If  a  Bayes  approach  is  to  be  followed — that  is,  0  is  assumed  to  be 

,i  -.Hid om  variable  and  a  probability  distribution  is  assigned  to  it — two 

5 

?asxc  questions  concerning  justification  arise  : 

(i)  adequacy  >f  the  assumption  of  randomness  of  6  and  knowledge 
of  thu  urio.-  probability  distribution  of  0; 

(ii.)  i.nterp  station  of  the  minimum  expected  loss  as  optimum. 

Cbvio  isiy  tli’'  ans.v  .j  „o  (i) ,  (ii)  will  depend  on  the  probability 
distributio  as:if  u.ant  technique.  The  following  sections  will  discuss 
several  of  these  assignment  techniques,  including  Kashyap’s,  in  light  of 
(i),  Ci). 
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1.  Subjective  Bayes 


Essentially  a  subjectivist  holds  the  view  that  probability  measures 
a  person’s  degree  of  belief  as  evidenced  by  his  betting  or  action  behavior. 
DeFinetti^  demonstrates  that  if  a  person  is  consistent  in  placing  his  bets, 
his  subjective  probability  assignments  will  satisfy  the  usual  laws  of  prob¬ 
ability.  From  this  point:  of  view  then,  (i)  is  adequately  answered  by  the 
subjective  assignment  of  a  probability  distribution  to  0. 

In  order  to  answer  question  (ii) ,  the  subjectivist  views  the  loss 
function  as  a  negative  utility  in  the  sense  of  Von  Neumann-Morgenstem. 

In  Ref.  7  Savage  showed  these  concepts  of  utility  and  subjective  prob¬ 
ability  can  be  linked  together.  Now  the  optimality  of  the  minimum  expected 
loss,  where  the  expectation  is  with  respect  to  the  subjective  probability 
distribution,  follows  from  the  expected  utility  hypothesis.  Thus,  ques¬ 
tion  (ii)  is  answered. 

There  are  some  objections  to  this  approach.  One  is  the  practical 
difficulty  in  assigning  the  probability  distribution  such  that  there 
are  no  inconsistencies.  A  more  fundamental  objection  is  that  two  different 
people,  when  faced  with  the  same  decision  problem,  may  end  up  with  two 
completely  different  solutions. 

All  in  all,  however,  in  a  non-repetitive  situation — that  is,  the 
decision  problem  is  a  "one  shot"  affair — the  subjective  approach  is  hard 
to  argue  against,  even  considering  the  dif fuculties  of  consistent  assign¬ 
ment  and  nonobjectivity.  However,  in  repetitive  type  situations  where 


5 


39 


r  • 

there  ia  a  true  underlying  frequency  probability  distribution  for  0 — 
albeit  unknown — the  subjective  approach  seems  hard  to  defend-  It  has 
no  guarantee  against  the  improper  se  of  information — the  actual  average 
incurred  loss  may  differ  widely  from  the  expected  loss.  Hence,  the 
argument  for  answering  (ii) ,  while  valid  for  non- repetitive  situations, 
is  not  satisfactory  for  repetitive  situations. 

2.  Empirical  Bayes 

The  empirical  Bayes  approach  is  suitable  for  situations  in  which 

the  decision  problem  presents  itself  repeatedly  and  independently  with 

a  fixed  but  unknown  underlying  frequency  distribution  for  0.  A  good 

1  8 

summary  of  the  method  is  given  by  Robbins  .  (Maritz  also  summarizes 
the  method  and  contains  more  current  references.)  Essentially,  in  using 
the  empirical  Bayes  approach,  the  decision  maker  forms  sequential  decision 
rules  which  utilize  past  observations  to  extract  some  information  about 
the  frequency  distribution  of  0.  Under  certain  conditions  then  it  can 
be  shown  that  as  n  -*■  00  (  n  is  the  number  of  repetitions) ,  the  risk  cor¬ 
responding  to  these  decision  rules  converges  to  the  actual  Bayes  risk 
that  results  if  the  frequency  distribution  of  0  were  known.  (This  prop¬ 
erty  is  called  asymptotic  optimality  of  the  decision  rule.) 

Consequently,  in  the  limit  as  n  ->•  00 ,  (i)  and  (ii)  can  be  answered 
by  the  same  arguments  given  for  when  the  frequency  distribution  of  0  is 
known;  that  is,  by  the  law  of  large  numbers,  the  Bayes  risk  approaches 
the  actual  average  loss. 
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The  major  objection  to  this  approach  is  that  the  value .of  n  neces¬ 
sary  for  convergence  is  not  clearly  specified.  In  practical  situations 
there  may  be  only  a  limited  number  of  observations  available.  In  these 
cases,  general  results  for  the  assessment  of  the  performance  of  empirical 
Bayes  methods  have  not  yet  been  found  (p.  viii,  Ref.  8). 


3.  Objective  Bayes 

In  using  the  objective  Bayes  approach,  we  assume  that  the  problem 
is  repeated  often  enough  so  that  a  purely  subjective  approach  is  un¬ 
satisfactory.  In  general,  the  situation  can  be  characterized  as  one  in 
which  the  decision  problem  presents  itself  repeatedly  and_ independently 
whith  the  same  underlying  frequency  distribution  of  0,  but  the  number 
of  repetitions  is  insufficient  for  convergence  of  an'  adaptive  decision 
procedure  such  as  empirical  Bayes.  For  a  problem  of  this  type,  initial 
choices  for  the  unknown  probability  distributions  will  have  a  pronounced 
effect  on  the  average  loss  incurred. 


In  a  sense  then,  the, question  of  randomness  of  0  is  answered  by 
the  type  of  problem  considered — 0  is  assumed  to  have  an  unknown  frequency 
probability  distribution.  The  answer  to  the  second  part  of  question  (i) 
lies  in  the  inference  technique  used.  Since  the  actual  performance  of 
the  decision  rule  can  be  noticeably  affected  by  the  inferred  prior  prob¬ 
ability  distribution  of  8,  a  conservative  approach  is  desirable.  Kashyap' 
inference  technique  has  th.is  conservative  character.  Thus,  since  the 
true  frequency  distribution  of  0  is  unknown,  a  conservative  "estimate" 
of  it  is  used  instead. 
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Note  that  this  estimate  is  conservative  from  the  inference  point 

of  view  and  is  not  necessarily  conservative  as  far  as  the  decision  problem 

is  -concerned,  that  is,  the  actual  average  loss  incurred.  This  leads  to 

* 

question  (ii) — what  is  the  interpretation  of  the  resulting  risk  rCP£,de)? 

* 

rCP£,d£)  is  really  just  an  estimate  of  the  average  loss  that  will  be 

■k 

incurred  using  d£,  but  due  to  the  nature  of  the  problem  its  accuracy 

cannot  be  verified,  just  placed  within  limits.  Some  of  the  difficulties 
* 

in  using  r(Pe»de)  as  the  sole  measure  of  performance  are  discussed  in 
Sec.  III. 


4.  Restricted  Objective  Bayes 

If  the  decision  maker  is  not  satisfied  with  the  bounds — that  is, 

the  maximum  possible  average  loss  is  too  high — he  may  wish  to  choose  a 
it 

decision  rule  y  to  minimize  his  upper  bound,  that  is,  his  maximum  possible 

* 

average  loss.  Of  course,  y  is  simply  a  minimax  decision  rule.  However, 
this  minimax  approach  has  certain  drawbacks. 

A  compromise  between  these  two  approaches — objective  Bayes  and 

minimax — is  offered  by  the  restricted  objective  Bayes  approach.  This 

approach  has  some  of  the  advantages  of  both  the  minimax  and  unrestricted 

Bayes  decision  rules.  On  one  hand  it  allows  the  decision  maker  to  restrict 

his  maximum  possible  loss  to  some  level  C,  but  on  the  other  hand,  since 

* 

P£  is  used  in  evaluating  the  risk  that  minimizes,  it  isn  t  so  pes¬ 
simistic  Luat  it  concentrates  unduly  (as  determined  by  C)  on  the  worst 
states  of  nature.  Examples  illustrating  this  behavior  are  given  in  Part  I. 
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In  a  sense  then,  our  restricted  objective  Bayes  rule  is  a  hybrid — 
it  is  based  on  an  ob j  ective  prior  probability  and  loss  function  but  is 
affected  by  an  arbitrary  value  of  C.  However,  since  this  arbitrariness 
is  restricted  to  the  choice  of  C  at  the  "end"  of  the  problem  solution 
rather  than  in  the  beginning  as  is  the  case  with  the  subjective  approach, 
this  approach  avoids  some  of  the  criticisms  of  the  purely  subjective 
approach.  Namely,  it  is  applicable  to  situations  in  which  the  decision 
problem  is  faced  repeatedly,  and  the  effect  of  the  subjectivity  is  easily 
seen.  Ultimately  the  decision  problem  is  subjective,  and  this  seems  to 
he  a  good  place  to  account  for  it. 

Undoubtedly  there  are  those  who  will  disagree  with  some  of  the 
arguments  and  assumptions  used  above.  In  the  next  section  the  actual 
performances  of  objective  Bayes  rules  and  minimax  rules  will  be  compared 
to  provide  a  more  real  or  operational  justification  of  our  approach. 

C.  COMPARISON  WITH  MINIMAX  DECISION  RULES 

For  the  type  of  problem  being  considered — repetitive  but  not  neces¬ 
sarily  having  an  infinite  number  of  repetitions  of  the  decision  problem — 
it  is  felt  that  the  minimax  decision  rule  is  the  main  competitor  of  the 
objective  Bayes  decision  rules. 

1.  Basic  Differences 

The  minimax  approach  is  the  most  conservative  decision  making 
approach-r-it  can  be  given  a  zero-sum  game  interpretation. 
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A  comparison  with,  the  objective  Bayes  decision  rules  can  be  made 
by  viewing  these  minimax  decision  rules  as  simply  Bayes  rules  with  respect 
to  the  least  favorable  distribution  over  0.  The  difference  between  the 
two  approaches  then  lies  in  the  prior  distributions  of  0  that  are  assumed. 
In  the  objective  Bayes  approach,  Kashyap's  inference  technique  is  used, 
which  itself  is  an  interpretation  of  die  solution  of  a  two-person  zero- 
sum  game.  However,  the  essential  difference  is  that  this  inference  pro¬ 
cedure  is  independent  of  the  values  in  the  loss  function;  that  is,  P  (6) 
does  not  depend  on  the  losses  assigned  in  the  decision  problem.  However, 
as  noted  it  does  depend  on  the  form  of  the  loss  function.  This  is  not 
the  case  for  the  least  favorable  distribution,  which  obviously  depends 
upon  the  loss  function  values. 

It  is  felt  that  the  major  objection  to  minimax  decision  rules — 
their  high  sensitivity  to  greater  risk  with  low  sensitivity  to  less  risk — 
arises  because  of  this  dependence  of  the  inference  of  the  loss  function. 

By  avoiding  this  dependence,  the  objective  Bayes  decision  rules  are  not 
as  subject  to  this  criticism.  The  following  example  will  illustrate  this. 

Consider  the  urn  problem  described  in  Part  I  and  the  single  experi- 

ment  e  . 

- ul 


If  y  denotes  the  minimax  decision  rule  for  this  problem,  where 

ft  ft  ft 

Y  selects  and  with  probabilities  Yp  Y2>  respectively,  then 


Yi 


.524 


Y2  =  .476 


(2-1) 
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The  corresponding  minimax  risk  is 


r(PW,Y*)  =  -23.43 


(2-2) 


The  objective  Bayes  decision  rule  for  experiment  e  ^  is 


d  =  a  _ 
e  .  u5 
ul 


(2-3) 


with  corresponding  risk 


r(Pe  ,d  )  =  -47 
ul  ul 


(2-4) 


,  w  *  * 

Comparison  of  r(P  ,Y  )  and  r(P  ,d  )  is  not  too  meaningful  since  these 

eul  eul 

risks  are  evaluated  at  different  probability  distributions  for  0.  A  more 
meaningful  comparison  is  to  consider  their  respective  risks  evaluated 
with  the  true  probability  of  0^,p.  These  risks  can  easily  be  evaluated. 


r(p,d  )  =  66p  -  80 
ul 


r(p,Y  )  =  "23.43 


(2-5) 


They  are  plotted  versus  p  in  Fig.  2-1. 


Notice  that  if  p  <  .857,  the  risk  incurred  using  the  objective 
Bayes  rule  is  less  than  that  incurred  using  the  minimax  rule.  In  fact, 
the  objective  Bayes  rule  can  be  almost  four  times  less  than  the  minimax 
risk  (-80  versus  -23.43  when  6  =  0^).  On  the  other  hand,  the  objective 
Bayes  risk  can  go  as  high  as  -14  versus  the  -23.43  bound  on  the  minimax 
risk . 
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Y  :  minimax 

* 

d  :  objective  Bayes 
eul 
* 

X  :  restricted  objective 
Bayes 


Figure  2-1:  Comparison  of  Risks  for  Urn  Problem 
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This  then  demonstrates  the  sensitivity  of  the  minimax  approach — 
highly  sensitive  to  greater  risks,  much  less  sensitive  to  lesser  risks. 
This  is  the  major  criticism  of  the  minimax  approach. 

However,  if  the  decision  maker  ohjects  to  the  possibility  of  an 
average  loss  of  -$14,  he  can  use  a  restricted  objective  Bayes  rule  to 
reduce  this  maximum  risk.  Let  C  denote  the  desired  maximum  possible 
risk.  Then  the  restricted  objective  Bayes  rule  A  for  experiment  e  -  is 

C  -f  14  ,*  32  +  C 

1  18  »  A2  =  18  (2-6) 

,  ^ 
where  -23.43  £  C  -14  and  A^,  A^  represent  the  probabilities  that  A 

selects  and  respectively. 

—  * 

For  C  =  -20,  the  risk  r(p,A  )  is  given  by 

r(p,A*)  =  24p  -  44  (2-7) 

This  is  plotted  in  Fig.  2-1.  Notice  that  in  exchange  for  a  smaller  maxi¬ 
mum  risk,  the  minimum  possible  risk  is  reduced.  As  C  approaches  -23.43, • 
these  upper  and  lower  bounds  converge  and  the  restricted  objective  Bayes 
rule  approaches  the  minimax  rule.  Thus,  the  restricted  objective  Bayes 
rule  can  be  viewed  as  a  sort  of  compromise  between  the  objective  Bayes 
and  minimax  decision  rules. 

Another  consideration  for  use  in  the  comparison  of  objective  Bayes 
and  minimax  decision  rules  is  the  use  of  sampling  information.  There 
are  various  ways  for  the  objective  Bayes  approach  to  use  this  information, 
essentially  by  the  use  of  judicious  approximations. 
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However,  a  true  minimaxer  can’t  really  make  even  these  approximation! 
he  is  only  concerned  with  the  possibility  that  P(0)  can  assume  its  least 
favorahle  value  and  not  the  probability  that  it  does. 

9 

Radner  and  Marshak  give  a  good  example  illustrating  some  of  the 
objections  to  using  minimax  rules  when  sampling  is  allowed.  They  essen¬ 
tially  have  found  a  situation  in  which  even  though  P(0)  is  unknown,  the 
optimum  minimax  decision  rule  calls  for  no  sampling,  no  matter  how  cheaply 
it  may  be  obtained.  The  problem  is  that  as  long  as  it  is  possible  for 
P(0)  to  be  least  favorable,  the  minimax  approach  will  assume  it  is  so. 

Thus,  in  summary,  the  attitude  of  the  minimax  approach  makes  it 
difficult  to  use  "uncertain"  information. 

2.  Variations  in  the  Loss  Function 

So  far  we  have  assumed  that  the  loss  function  is  a  known,  scalar 
performance  index.  However,  if  we  allow  unknown,  random  terms  in  the 
loss  function,  the  sensitivity  of  the  minimax  approach  can  be  considerably 
increased  relative  to  the  objective  Bayes  approach. 

Another  shortcoming  in  the  minimax  approach  arises  when  multiple 
losses  are  considered;  that  is,  the  performance  index  may  be  given  by 
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The  objective  Bayes  approach  is  suitable  for  application  to  decision 
problems  having  performance  indices  of  this  form  due  to  the  independence 
of  the  inference  from  the  loss  function.  The  minimax  approach,  on  the 
other  hand,  isn't  really  suitable  for  these  types  of  performance  indices 
since  evaluation  of  either  Eqs.  2-8  or  2-9  requires  a  probability  dis¬ 
tribution  for  0.  However,  for  these  performance  indices,  the  minimax 
approach  doesn't  indicate  a  way  to  find  this  probability  distribution. 


D.  SUMMARY 

This  section  has  been  concerned  with  the  justification,  for  fi:  id 
experimentation ,  of  the  objective  ?ayes  approach.  Part  B  contained  reason¬ 
ableness  arguments — what  properties  should  a  decision  criterion  possess; 
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what  interpretation  can  be  given  to  the  resulting  expected  losses?  In 
this  section  a  more  practical  approach  \-as  '.aken — is  there  another  ob¬ 
jective  approach  that  does  as  well  or  better  than  the  objective  Bayes 
approach? 

Based  on  the  answers  to  these  questions  and  the  number  of  situations 
in  which  the  objective  Bayes  approach  is  better  than  the  minimax  approach, 
we  feel  the  objective  Bayes  is  a  justifiable  approach  for  fixed  experi¬ 
mentation  .  In  the  next  section  the  question  of  its  justification  when 
there  are  several  priors  (that  is,  several  different  experiments)  in  the 
decision  problem  is  considered. 

III.  MULTIPLE  EXPERIMENTS 

Assume  that  there  are  a  finite  number  of  experiments  from  which  to 
choose.  The  basic  difficulty  confronting  us  is  the  possible  variation  in 
priors  with  experiment.  (P(y|0)  is  fixed  throughout  an  experiment,  but 
may  vary  between  experiments  and  consequently  so  can  the  objectively 
inferred  prior.) 

To  attack  this  problem,  we  formulated  the  quasi-Bayes  approach. 

We  are  now  interested  in  the  validity  of  this  approach.  We  shall  use 
much  the  same  procedure  as  in  Sec.  II — axiomatic  considerations,  reason¬ 
ableness  arguments  and  direct  comparison  with  the  minimax  approach. 
However,  in  order  to  illustrate  some  of  the  properties  of  the  quasi- 
Bayes  approach,  we  shall  first  Introduce  a  new  approach — the  standard 
approach — and  compare  the  two. 
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A.  ALTERNATE  APPROACH  TO  SELECTION  OF  EXPERIMENT 


1.  Standard  Approach. 

A  typical  way  of  selecting  an  experiment  and  ultimately  the  "optimal" 
decision  rule  is  to  find  the  best  decision  rule  for  each  experiment  and 
then  select  the  experiment  whose  associated  best  decision  'rule  has  the 
minimum  risk.  We  shall  call  this  approach  the  standard  approach. 

We  can  formalize  this  approach  as  follows.  For  each  experiment 

e^  c  E,  i=l,2,...,n,  let  the  associated  objective  Bayes  decision  rule  be 
* 

denoted  by  A  .  That  is, 
ei 

A*  =  Arg  Min  r(P  ,A  )  (3-1) 

ei  A  ei  ei 

ei 

where  P  is  the  objective  prior  probability  density  for  experiment  e. 
ei 

found  using  Kashyap's  inference  technique. 

In  addition  there  may  be  restrictions  of  the  form 

R(9,A*  )  <  Cs  ,  ¥  0  e  0  ,  i»l,2 . n  (3-2) 

on  these  decision  rules. 

Now  let  6  be  a  randomized  rule  for  selecting  the  experiment,  where 

6  ,  i=l,2,...,n,  correspond  to  the  probability  that  experiment  e.  is 
i  * 

selected.  If 

r(<5)  =  <5^r (Pg  ,A*  )  +  •  *  *  +  ) 

1  1  “n  n 
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then  following  the  standard  approach.,  the  rule  for  selecting  the  experi- 
ment  is  given  hy  q  where 

1  6  *  Arg  Min  r(6)  (3-3) 

6 

(Wo  have  introduced  this  notation  to  facilitate  comparison  with  the  quasi- 

Bayes  approach.  Obviously  <5  will  be  a  degenerate  probability  distribution 

* 

which  simply  selects  the  experiment  having  minimum  r(Pe,Ae>.) 

2.  Comparison  of  Approaches 

The  basic  difference  between  the  two  approaches  is  that  with  the 
standard  approach  an  experiment  is  selected  only  on  the  performance  of 
its  "best"  decision  rule  whereas  with  the  quasi-Bayes  approach  an  experi¬ 
ment  can  be  selected  On  the  performance  of  any  decision  rule  leading  to 
an  admissible  strategy.  (A  strategy  is  any  decision  rule-experiment 
combination.)  There  are  two  major  consequences  of  this  difference  that 
are  of  interest  to  us:  the  admissibility  of  the  experiment  and  consequent 
decision  rule  which  are  chosen,  and  the  difference  in  ranges  of  C„  and 

'  O 

c. 

If  the  inferred  prior  Pe(0)  Is  the  same  for  all  e  e  E,  then  the 
experiment'  and  consequent  decision  rule  selected  by  the  standard  approach 
will  be  admissible  relative  to  the  other  possible  experiment-decision 
rule  combinations.  However,  if  the  inferred  prior  Pg(0)  varies  with  the 
experiment,  then  it  is  quite  possible  that  the  experiment-decision  rule 
selected  by  the  standard  approach  will  be  inadmissible  relative  to  the 
other  strategies  Using  the  quasi-Bayes  approach,  it  Is  easy  to  avoid 
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this  by  initially  considering  only  admissible  strategies;  that  is,  we 

restrict  p  to  admissible  strategies  and  admissible  mixtures  of  these 

* 

strategies.  Consequently,  the  resulting  p  is  admissible. 

Another  difference  between  the  approaches  is  the  ranges  of  C,  C^. 
Specifically,  It  is  possible  for 

min  (C)  <  rain  (C^)  (3-4) 

where  min  (-)  means  that  if  C  or  Cg  is  lers  than  this  value,  no  solutions 
exist  which  satisfy  the  constraint. 

It  is  easy  to  see  this  from  a  graphical  point  of  view.  Assume 
E  a  {eQ,e^}  and  that  there  are  three  admissible  strategies  for  each 
experiment — 0^,0.^,  i=l,2,3 — having  conditional  risks  R(0,o)  as  plotted 
in  Fig.  3-1.  Only  the  lower  left  hand  boundaries  of  the  risk  sets  are 
drawn. 

From  Fig.  3-1  we  see  that  if  C  =  C,  we  can  obtain  a  rainimax  risk 
of  C  using  the  restricted  quasi-Bayes  approach  with  C  =  £.  However, 
with  the  standard  approach,  if  Cg  <  C  +  A,  Fig.  3-1  shows  that  no  solu- 

•k  * 

tions  A  ,  A  exist  which  satisfy  constraint  (3-2). 
o  1 

Moreover,  using  the  quasi-Bayes  approach  with 

C<  C<C  +  4  (3-5) 
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we  can  obtain  restricted  quasi-Bayes  solutions  whereas  no  solutions  exist 
for  corresponding  values  of  Cg  using  the  standard  approach-.  (Note  that 
this  is  ture  regardless  of  the  priors  assigned  to  0.) 

—  —  * 

If  C,  Cc  are  so  large  as  to  be  ineffective,  then  A  ,  i-l,2,...n, 
b  ei 

will  be  nonrandom  decision  rules  and  the  standard  approach  will  yield  the 

* 

decision  rule  and  experiment  having  minimum  r(P  ,A  ) .  Assuming  that 

ei  ei 

this  experiment-decision  rule  is  an  admissible  strategy,  the  quasi-Bayes 
approach,  will  yield  the  same  decision  rule  and  experiment. 

Now  if  C,  Cg  are  of  such  values  that  they  are  effective,  it  is 
hard  to  compare  the  resulting  selections.  The  standard  approach  will 
select  an  experiment  and  then  randomize  over  possible  decision  rules 
for  this  experiment.  The  quasi-Bayes  approach  will  randomize  over  both 
experiment  and  decision  rule;  in  fact,  when  a  strategy  is  selected,  the 
decision  rule  to  be  used  in  the  experiment  is  specified.  Consequently, 
following  the  quasi-Bayes  approach,  an  experiment  may  be  selected  and 
performed  and  a  decision  rule  used  which  might  not  be  optimal  if  that 
experiment  along  were  being  considered.  Thus,  if  C,  Cg  are  such  that 
they  are  effective,  the  standard  approach  will  yield  decision  rules  which 
are  consistent  within  the  chosen  experiment,  and  the  quasi-Bayes  approach 
can  yield  decision  rules  which,  while  not  necessarily  the  best  for  the 
given  experiment,  are  optimal,  in  the  sense  defined,  over  the  set  of 
experiments . 


21 


Note  that  when  we  say  the  decision  problem  is  faced  repeatedly  we 
wean  that  the  selection  of  both  the  experiment  and  the  decision  ru2e  is 
faced  repeatedly.  Consequently,  we  feel  that  the  quasi-Bayes  approach 
is  a  "better"  approach  than  the  standard  approach. 

B.  AXIOMATIC  CONSIDERATIONS 

The  axiomatic  approach  was  briefly  discussed  in  Sec.  II.  It  was 

stated  there  that  the  axioms  presented  by  Luce  and  Raiffa  were  satisfied 

> 

by  the  objective  Bayes  criterion,  with  the  exception  that  the  ordering 
is  altered  if  the  number  of  states  is  changed  by.  deleting  a  state  having 
the  same  losses  as  another  state  for  any  action  a  c  A.  Unfortunately, 
the  extension  of  the  objective  Bayes — the  quasi-Bayes  approach — does  not 
satisfy  all  the  basic  axioms. 

For  example,  admissibility  of  the  optimal  decision  rula  Is  a  basic 
axiom,  and  we  can  only  obtain  it  by  restricting  p  to  admissible  strategies 
and  admissible  mixtures  of  strategies.  Without  this  restriction  the  final 
choice  of  experiment-decision  rule  might  not  be  admissible. 

This  restriction  can  cause  some  other  discrepancies,  however.  For 
ejfcample,  if  two  strategies  are  optimal,  that  is,  quasi-Bayes  decision  rule 
then  a  mixture  p  of  them  may  have  the  minimum  r(T,p),  but  this  mixture 
may  be  inadmissible  and  hence  non-optimal.  In  other  words,  the  optimal 
set  of  strategies  is  not  necessarily  convex. 
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There  are  other  discrepancies.  If  we  add  a  constant  to  a  row  of 
the  matrix  (assuming  finite  0,A),  we  can  change  the  ordering  between 
strategies.  For  example,  suppose  $100  is  added  to  the  second  row  of 
the  loss  matrix  for  the  ball  problem.  Table  5-1,  Part  I.  Then  the  priors 

P  »  P  remain  unchanged  but  tne  risks  r(P  ,0,  „) ,  r(P  A,)  go 
%o  ®bo  ®bl 

from  -29.91  and  -30.80  to  3.34  and  4.50,  respectively.  Hence,  the  (un¬ 
restricted)  quasi-Bayes  rule  is  changed  from  a,  to  a  ^  by  adding  a 
constant  to  a  row  of  the  loss  matrix. 

While  the  failure  of  the  quasi-Bayes  approach  to  satisfy  these 
axioms  is  discouraging,  it  should  not  preclude  the  use  of  this  approach, 
since,  as  mentioned  previously,  no  one  decision  criterion  has  been  found 
which  has  all  the  desired  properties  as  expressed  by  the  axioms.  In 
view  of  this,  the  ultimate  justification  of  this  approach  is  how  well  it 
performs  relative  to  the  alternative  approaches.  (It  should  be  noted 
that  the  standard  approach  exhibits  these  same  faults.) 

C.  REASONABLENESS  ARGUMENTS 

In  Sec.  II  arguments  were  presented  for  the  reasonableness  of  certain 
Bayes  procedures  for  different  situations,  Similar  arguments  can  be  given 
for  using  the  quasi-Bayes  approach  to  select  the  best  experiment.  The 
repetitive  nature  of  the  problem  is  the  same  (except  now,  of  course,  the 
decision  maker  repetitively  selects  his  experiment)  and  so  conservative 
estimates  of  the  prior  probability  of  6  for  each  experiment  are  desirable. 
Again,  the  interpretation  of  the  resulting  risk  is  arbitrary;  the  risk 
can  be  bounded,  but  otherwise  no  properties  such  as  convergence  to  the 
actual  average  loss  can  be  claimed. 
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In  Part  I,  Sec.  VI,  an  example  was  solved  using  both  quasi-Bayes 
and  minimax  decision  rules.  From  Fig.  6-2,  Part  I,  it  can  be  seen  then 
that  the  quasi-Bayes  approach  is  a  viable  one  (at  least  for  this  particular 
problem).  A  comparison  between  the  two  approaches  was  made  for  another 
problem-control  of  a  dynamic  system  with  unknown  parameters^ — with 
much  the  same  type  results. 

In  general  it  is  felt  that  often  the  quasi-Bayes  approach  will 
yield  experiment-decision  rule  choices  whose  performance  is  quite  satis¬ 
factory.  From  a  practical  point  of  view,  this  fact  in  itself  is  sufficient 
to  justify  its  use. 


IV.  CONCLUSIONS 

Well  now,  where  do  we  stand  with  respect  to  justification  of  our 
approach  to  decision  making  under  uncertainty?  We  have  looked  at  alter¬ 
nate  approaches  in  terms  of  axiomatic  considerations,  reasonableness 
arguments  and  comparison  of  average  losses  incurred.  The  conclusion  that 
we  now  reach  is  that  the  objective  Bayes  (and  its  generalization  the 
quasi-Bayes)  approach  is  a  good  practical  approach  to  this  problem. 

There  are  inconsistencies  in  this  approach;  however,  the  majority 
of  them  only  occur  when  the  inferred  prior  varies  with  the  experiment. 

If  the  inferred  prior  is  the  same  for  all  experiments  under  consideration, 
the  justification  arguments  presented  in  Sec.  I  for  the  case  of  a  single 
experiment  apply  to  the  quasi-Bayes  approach. 
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